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Maximum-Likelihood Sequence Detector for 
Dynamic Mode High Density Probe Storage 

Naveen Kumar, Pranav Agarwal, Aditya Ramamoorthy and Murti V. Salapaka 

Abstract 

There is an increasing need for high density data storage devices driven by the increased demand 
of consumer electronics. In this work, we consider a data storage system that operates by encoding 
information as topographic profiles on a polymer medium. A cantilever probe with a sharp tip (few 
nm radius) is used to create and sense the presence of topographic profiles, resulting in a density of 
few Tb per in.^. The prevalent mode of using the cantilever probe is the static mode that is harsh on 
the probe and the media. In this article, the high quality factor dynamic mode operation, that is less 
harsh on the media and the probe, is analyzed. The read operation is modeled as a communication 
channel which incorporates system memory due to inter-symbol interference and the cantilever state. 
We demonstrate an appropriate level of abstraction of this complex nanoscale system that obviates the 
need for an involved physical model. Next, a solution to the maximum likelihood sequence detection 
problem based on the Viterbi algorithm is devised. Experimental and simulation results demonstrate 
that the performance of this detector is several orders of magnitude better than the performance of other 
existing schemes. 

I. Introduction 

Present day high density storage devices are primarily based on magnetic, optical and solid 
state technologies. Advanced signal processing and detection techniques have played an important 
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role in the design of all data storage systems (21, iBl, 01, m, [m, M^. Indeed tech- 
niques such as partial-response max-likelihood BTI . [|24ll were responsible for significantly 
improving magnetic disk technology. 

In this work, we consider a promising high density storage methodology which utilizes a sharp 
tip at the end of a micro cantilever probe to create, remove and read indentations (see ^22^). The 
presence/absence of an indentation represents a bit of information. The main advantage of this 
method is the significantly higher areal densities compared to conventional technologies that are 
possible. Recently, experimentally achieved tip radii near 5 nm on a micro-cantilever were used 
to create areal densities close to 1 Tb/in^ [22] . 

A particular realization of a probe based storage device that uses an array of cantilevers, along 
with the static mode operation is provided in IS]. However, there are fundamental drawbacks of 
this technique. In the static mode operation, the cantilever is in contact with media throughout 
the read operation which results in large vertical and lateral forces on the media and the tip. 
Moreover, significant information content is present in the low frequency region of the cantilever 
deflection and it can be shown experimentally that the system gain at low frequency is very 
small. Therefore, in order to overcome the measurement noise at the output, the interaction 
force between the tip and the medium has to be large. This degrades the medium and the probe 
over time, resulting in reduced device lifetime. 

The problem of tip and media wear can be partly addressed by using the dynamic mode 
operation; particularly when a cantilever with a high quality factor is employed. In the dynamic 
mode operation, the cantilever is forced sinusoidally using a dither piezo. The oscillating can- 
tilever gently taps the medium and thus the lateral forces are reduced which decreases the media 
wear ll25l . Using cantilever probes that have high quality factors leads to high resolution, since 
the effect of a topographic change on the medium on the oscillating cantilever lasts much longer 
(approximately Q cantilever oscillation cycles, where each cycle is l//o seconds long and Q 
and /o is the quality factor and the resonant frequency of the cantilever respectively). Moreover, 
the SNR improves as y/Q [|23l . However, this also results in severe inter-symbol-interference, 
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unless the topographic changes are spaced far apart. Spacing the changes far apart is undesirable 
from the storage viewpoint as it implies lower areal density. Another issue is that the cantilever 
exhibits complicated nonlinear dynamics. For example, if there is a sequence of hard hits on 
the media, then the next hit results in a milder response, i.e., the cantilever itself has inherent 
memory, that cannot be modeled as ISI. Conventional dynamic mode methods described in lUTl . 
that utilize high-Q cantilevers are not suitable for data storage applications. This is primarily 
because they are unable to deal with ISI and the nonlinear channel characteristics. The current 
techniques can be considered analogous to peak detection techniques in magnetic storage lfT4l . 

In this work we demonstrate that these issues can be addressed by modeling the dynamic 
mode operation as a communication system and developing high performance detectors for 
it. Note that corresponding activities have been undertaken in the past for technologies such 
as magnetic and optical storage lfT3l . e.g., in magnetic storage, PRML techniques, resulted in 
tremendous improvements. In our work, the main issues are, (a) developing a model for the 
cantilever dynamics that predicts essential experimental features and remains tractable for data 
storage purposes, and (b) designing high-performance detectors for this model, that allow the 
usage of high quality cantilevers, without sacrificing areal density. As discussed in the sequel, 
several concepts such as Markovian modeling of the cantilever dynamics and Viterbi detection 
in the presence of noise with memory |[T1, play a key role in our approach. 
Main Contributions: In this article, a dynamic mode read operation is researched where the 
probe is oscillated and the media information is modulated on the cantilever probe's oscillations. 
It is demonstrated that an appropriate level of abstraction is possible that obviates the need 
for an involved physical model. The read operation is modeled as a communication channel 
which incorporates the system memory due to inter-symbol interference and the cantilever 
state that can be identified using training data. Using the identified model, a solution to the 
maximum likelihood sequence detection problem based on the Viterbi algorithm is devised. 
Experimental and simulation results which corroborate the analysis of the detector, demonstrate 
that the performance of this detector is several orders of magnitude better than the performance 

3 



of other existing schemes and confirm performance gains that can render the dynamic mode 
operation feasible for high density data storage purposes. 

Our work will motivate research for fabrication of prototypes that are massively parallel and 
employ high quality cantilevers (such as those used with the static mode If22l and intermittent 
contact dynamic mode but with low-Q [5]). In current prototypes, the cantilever detection is 
integrated into the cantilever structure and the cantilevers are actuated electrostatically. Even 
though the experimental setup reported in this article uses a particular scheme for measuring the 
cantilever detection and for actuating the cantilever, the paradigm developed for data detection 
is largely applicable in principle to other modes of detection and actuation of the cantilever. The 
analysis criteria primarily assume that high quality factor cantilevers are employed and that a 
dynamic mode operation is pursued. 

The article is organized as follows. In Section |Ill background and related work of the probe 
based data storage system is presented. Section [ni] deals with the problem of designing and 
analyzing the data storage unit as a communication system and finding efficient detectors for 
the channel model. Section |IV] and Section |V] report results from simulation and experiment 
respectively. Section |VI] provides the main findings of this article and future work. 

II. Background and related vv'ork. 

Probe based high density data storage devices employ a cantilever beam that is supported at 
one end and has a sharp tip at another end as a means to determine the topography of the media 
on which information is stored. The information on the media is encoded in terms of topographic 
profiles. A raised topographic profile is considered a high bit and a lowered topographic profile 
is considered a low bit. There are various means of measuring the cantilever deflection. In 
the standard atomic force microscope setup, which has formed the basis of probe based data 
storage, the cantilever deflection is measured by a beam-bounce method where a laser is incident 
on the back of the cantilever surface and the laser is reflected from the cantilever surface into 
a split photodiode. The photodiode collects the incident laser energy and provides a measure 
of the cantilever deflection (see Figure [IJa)). The advantage of the beam-bounce method is 
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the high resolution (low measurement noise) and high bandwidth (in the 2-3 MHz) range. The 
disadvantage is that it cannot be easily integrated into an operation where multiple cantilevers 
operate in parallel. There are attractive measurement mechanisms that integrate the cantilever 
motion sensing onto the cantilever itself. These include piezo-resistive sensing [3] and thermal 
sensing 0. For the dynamic mode operation there are various schemes to actuate the cantilever 
that include electrostatic |l5l, mechanical by means of a dither piezo that actuates the support 
of the cantilever base, magnetic lIH and piezoelectric In this article, it is assumed that the 
cantilever is actuated by a dither piezo and the sensing mechanism employed is the beam bounce 
method (see Figure [I^a)). 

A. Models of cantilever probe, the measurement process and the tip-media interaction 

A first mode approximation of the cantilever is given by the spring mass damper dynamics 
described by 

P+^P + ^Ip = fit), y =p + v, (1) 

where p = p,f, y and v denote the deflection of the tip, the force on the cantilever, the 
measured deflection and the measurement noise respectively whereas the parameters ujq and Q are 
the first modal frequency (resonant frequency) and the quality factor of the cantilever respectively. 
The input-output transfer function with input f and output p is given as G = 2i "o -l ^ - The 
cantilever model described above can be identified precisely (see [[T8l ). 

The interaction force, h, between the tip and the media depends on the deflection p of the 
cantilever tip. Such a dependence is well characterized by the Lennard-Jones like force that is 
typically characterized by weak long-range attractive forces and strong short range repulsive 
forces (see Figure [He)). Thus, the probe based data storage system can be viewed as an 
interconnection of a linear cantilever system G with the nonlinear tip-media interaction forces 
in feedback (see Figure [IJb) and note that p = G{h + rj + g) with h = (f){p) lfT9l ). 
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B. Cantilever-Observer Model 

A state space representation of the filter G can be obtained as x = Ax + Bf, y = Cx + v 
where x = [p p]'^ and f = t] + g (assuming no media forces h) and A, B and C are given by, 
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Based on the model of the cantilever, an observer to monitor the state of the cantilever can 
be implemented IfTTI (see Figure [21). The observer dynamics and the associated state estimation 
error dynamics is given by. 



Observer 



State Estimation Error Dynamics 



X 



X = Ax + Bg + L{y - y);x{0) = Xo, 
y = Cx, 



= Ax + B{g + r])-Ax-Bg-L{y-y), 
= {A- LC)T + Br] - Lv, 
W{0) = x{0)-W{0), 

where L is the gain of the observer, x is the estimate of the state x and g is the external known 
dither forcing applied to the cantilever. The error in the estimate is given hy x = x — x, whereas 
the error in the estimate of the output y is given hy, e = y — y = Cx + v. The error between the 
observed state and the actual state of the cantilever, when no noise terms or media forces are 
present (rj = v = h = 0) is, only due to the mismatch in the initial conditions of the observer and 
the cantilever-tip. Note that the cantilever tip interacts with the media only for a small portion 
of an oscillation. It is shown in ifTTl that such a tip-media interaction can be modeled well as an 
impact force (in other words as an impulsive force) on the cantilever that translates into an initial 
condition reset of the cantilever state. The error process is white if the Kalman gain is used for 
L IfTTI . For cantilever deflection sensors with low enough and realizable levels of measurement 
noise, the effective length of the impulse response of the system with media force as input and 
the error signal e as the output can be made as short as four periods of the cantilevers first 
resonant frequency. 

As described in IfTTI . the discretized model of the cantilever dynamics is given by 

Xk+i = Fxk + G{gk + r]k) + Se^k+ii' , Vk = Hxk + Wfe, A; > , (2) 



where the matrices F, G, and H are obtained from matrices A, B and C using the zero order 
hold discretization at a desired sampling frequency and 5i,j denotes the dirac delta function. 6 
denotes the time instant when the impact between the cantilever tip and the media occurs and u 
signifies the value of the impact. The impact results in an instantaneous change or jump in the 
state by u at time instant 6. When a Kalman observer is used, the profile in the error signal due 
to the media can be pre-calculated as, 

Cfc = Vk - ilk = 'Tk-fi V + nk , (3) 

where {r^.g u} is a known dynamic state profile with an unknown arrival time defined by 
Ffc.g = H{F — LkHY"^ , for k > 9. is the Kalman observer gain, is a zero mean 
white noise sequence which is the measurement residual had the impact not occurred and 9 is 
assumed to be equal to for simplicity. The statistics of n are given by, E{njn^} = V5jk where 
V = HP^H^ + R and P§ is the steady state error covariance obtained from the Kalman filter 
that depends on P and R which are the variances of the thermal noise and measurement noise 
respectively. 

in. Channel model and detectors 

A. Reformulation of state space representation 

It is to be noted that although we have modeled the cantilever system as a spring-mass-damper 
model (second order system with no zeros and two stable poles)(see ©), the experimentally 
identified channel transfer function that is more accurate in practice has right half plane zeros 
that are attributed to delays present in the electronics. Given this scenario, the state space 
representation used in [[T71 leads to a discrete channel with two inputs as seen in ([3]) because the 
structure of B is no longer in the form of [0 1]^. However, source information enters the channel 
as a single input as the tip-medium interaction force. The problem can be reformulated as one of 
a channel being driven by a single input by choosing an appropriate state space representation. 
For the state space model of the cantilever, it is known that the pair {A, B) is controllable 
which implies there exists a transformation which will convert the state space into a controllable 
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canonical form such that B = [0 1]'^. This kind of structure of B will force the discretized 
model © to be such that one component of is equal to 0. With B chosen as above, the entire 
system can be visualized as a channel that has a single source. In this article, the single source 
model is used as it simplifies the detector structure and analysis substantially. 
B. Channel Model 

The cantilever based data storage system can be modeled as a communication channel as 
shown in Figure |3l The components of this model are explained below in detail. 
Shaping Filter {h(t)): The model takes as input the bit sequence a = (oq, ai . . . aAr_i) where 
ttk, k = 1, . . . , N — 1 is equally likely to be or 1. In the probe storage context, '0' refers to the 
topographic profile being low and '1' refers to the topographic profile being high. Each bit has a 
duration of T seconds. This duration can be found based on the length of the topographic profile 
specifying a single bit and the speed of the scanner. The height of the high bit is denoted by A. 
The cantilever interacts with the media by gently tapping it when it is high. When the media 
is low, typically no interaction takes place. We model the effect of the medium height using a 
filter with impulse response b(t) (shown in Figure [3]) that takes as input, the input bit impulse 
train a(t) = J2k=o ^k^i't ~ kT). The output of the filter is given by a{t) = J2k=o ^kK't — kT). 
Nonlinearity Block (0): The cantilever oscillates at frequency /o which means that in each 
cantilever cycle of duration Tc = 1/ fo, the cantilever hits the media at most once if the media is 
high during a time Tc. Due to the dynamics of the system it may not hit the media, even if it is 
high. The magnitude of impact on the media is not constant and changes according to the state 
of the cantilever prior to the interaction with the media. We note that a very accurate modeling of 
the cantilever trajectory will require the solution of complex nonlinear equations corresponding 
to the cantilever dynamics and knowledge of the bit profile so that each interaction is known. In 
this work we model the impact values of the tip-media interaction by means of a probabilistic 
Markov model that depends on the previous bits. This obviates the need for a detailed model. 
We assume that in each high bit duration T, the cantilever hits the media q times (i.e. T = qT^) 
with varying magnitudes. Therefore, for N bits, the output of the nonlinearity block is given by, 
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a(t) = J2k=o^ i^k{(i)^{t—kTc), where Uk denotes the magnitude of the k^^ impact of the cantilever 
on the medium. Here, we approximate the nonlinearity block output as a sequence of impulsive 
force inputs to the cantilever. The strength of the impulsive hit at any instant is dependent on 
previous impulsive hits; precisely because the previous interactions affect the amplitude of the 
oscillations that in turn affect how hard the hit is at a particular instance. The exact dependence 
is very hard to model deterministically and therefore we chose a Markov model, as given below 
for the sequence of impact magnitudes for a single bit duration, 

z/i = 0(aj,ai_i, . . . ,ai_^) + bj (4) 

where z/j = [vig z/j^+i . . . z/(j4.i)g„i]^ and (S(ai, aj_i, . . . , aj_m) is a function of the current and 
the last m bits. Here m denotes the system memory and hi is a zero mean i.i.d. Gaussian vector 
of length q. The appropriateness of the model will be demonstrated by our experimental results. 
Channel Response (r(t)): The Markovian modeling of the output of the nonlinearity block as 
discussed above allows us to break the feedback loop in Figure [21 (see also The rest of 

the system can then be modeled by treating it as a linear system with impulse response T{t). 
V{t) is the error between the cantilever tip deflection and the tip deflection as estimated by the 
observer when the cantilever tip is subjected to an impulsive force. It can be found in closed 
form for a given set of parameters of cantilever-observer system (see ([3])). 
Channel Noise {n{t)): The measurement noise (from the imprecision in measuring the cantilever 
position) and thermal noise (from modeling mismatches) can be modeled by a single zero mean 
white Gaussian noise process (n(t)) with power spectral density equal to V . 
The continuous time innovation output e(t) becomes, e(t) = i/(a)) + n(t), where ^{a)) = 
J2k=o^ ^k{o)^{t — kTc) and z/(a) = {yQ{a), yi{a) . . . yNq-iio))- The sequence of impact values 
Ui is assumed to follow a Markovian model as explained above, V{t) is the channel impulse 
response and n{t) is a zero mean white Gaussian noise process. 
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C. Sufficient Statistics for Channel model 

Before providing sufficient statistics we consolidate the notation used. The source stream is 
elements long (a denotes the sequence of source bits), with the topographic profile and the 
scan speed is chosen such that the cantilever impacts any topographic profile q times. Thus there 
are Nq possible hits with via) denoting the sequence of strength of the Nq impulsive hits on 
the cantilever. Furthermore, the set of strengths of impulsive force inputs, which is q elements 
long, during the z*'* topographic profile encoding the i^^' source symbol is denoted by Ui. Given 
the probabilistic model on u and finite bit sequence (a), an information lossless decomposition 
of e(t) by expansion over an orthonormal finite-dimensional basis with dimension N can be 
achieved where N orthonormal basis functions span the signal space formed by ^{a)). The 
components of e{t) over N orthonormal basis functions are given by, c = s(p(a)) + n, where 
c = (co, ci . . . c^), s{v{a)) = (so, si . . . s^), n = (rio, ni . . . n^) and n ~ iV(0, VIj^^j^) 
where /^xat stands for N x N identity matrix [[TOl . The maximum likelihood estimate of the 
bit sequence can be found as a = argmaXag{o,i}iv /(e|a) where a = (oq, cli . . . un-i) is the 
estimated bit sequence and / denotes a pdf. The term /(e|a) can be further simplified as, 

/(e|a)= / fi-c\a,u)f{u\a)du= [ exp[±:|M^]/(z/|a)rfi. 

= ^exp— — / exp[ — ]f{'y\a)du 

{27rV)- J- 

where denotes Euclidean norm, /(e|a, v) and f{i'\a) denote the respective conditional pdf's 
and z/ = (z/q, ui . . . VNq-i)- The correlation between e and s(z/(a)) can be equivalently expressed 
as an integral over time because of the orthogonal decomposition procedure i.e. z^s{u{a)) = 
IZc i'{d))dt = u^z', where z/ = (z^o, ■ ■ ■ t^Nq-i), z' = {z'q, z[ . . . z'j^^^^) and = 

/^e(t)r(t - kTc)dt for < k < Nq - 1 is the output of a matched filter T{-t) with input 
e{t) sampled at t = kTc. The term /(e|a) can now be written as, 

= w 2V / 2V exp-^f{u\a)du 



{27rVy 
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S{z'\a) 



So fit\d) can be factorized into f)(e) (dependent only on e) and '^{z'\a) (for a given d dependent 
only on z'). Using the Fisher-Neyman factorization theorem [2], we can claim that z' is a vector 
of sufficient statistics for the detection process i.e. -jjjlj^ = C, where C is a constant independent 
of d. So we can reformulate the detection problem as, a = argmaXag{o,i}iv f{z'\d) which means 
that bit detection problem depends only on the matched filter outputs (z'). These matched filter 
outputs for < A; < Nq — 1 can be further simplified as, 2;^ = J2ki=o ^^i + ^fe' where 

h'^_^^ = /^r(t - kT^)T{t - kiT^)dt and < = J^^n{t)T{t - kT^)dt such that E{n'^^n'y) = 

JZoI-ooE{n{t)n{T))T{t-kT,)T{T-k'T,)dtdT = VR,.k>, whore Rk-k' = JZo^{t-kT,)T{t- 
k'Tc)dt. A whitening matched filter can be determined to whiten output noise n'^ [[TOl . We shall 
denote the discretized output of whitened matched filter shown in Figure |4] as Zk, such that 
Zk = J2ii=o ^k-ki{d)hki + rifc, where the filter {hk}k=o,i,...,i denotes the effect of the whitened 
matched filter and the sequence {n^} represents the Gaussian noise with variance V. 

D. Viterbi Detector Design 

Note that the outputs of the whitened matched filter z, continue to remain sufficient statistics 
for the detection problem. Therefore, we can reformulate the detection strategy as, 

a = arg max fiz\a) = axg max H^n^ /(^il'^i ^o~^) (5) 

ae{0,l}^ ae{0,l}^ 

where z = [zq zi . . . z^q-iY , Zi is the received output vector corresponding to the i^^ input 
bit, i.e., Zi = [zig Zig+i . . . and Zq~^ = [zq zf . . . zf_i\^ . In our model, the channel 

is characterized by finite impulse response of length I i.e. hi = for ? < and i > I and 
we assume that / < mjq i.e. the inter-symbol-interference (ISI) length in terms of q hits is 
equal to m/. Let m be the system memory (see dH)). The length of channel response is known 
which means that rrij is known but the value of m cannot be found because it depends on the 
experimental parameters of the system. In the experimental results section, we describe how we 
find the value of m from experimental data. The received output vector Zi can now be written 
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as, 



hi . 

hi 



ho . 
. ho 








\ ^ u- i\ 



\ . . hi . . hoj yz/(i+i)5_i y 
where Ui = [uiq uiq+i . . . z>^_^^ = ■ ■ ■ uJY and = [mq ni+iq . . . n(i+i)g_i]^. 

Our next task is to simplify the factorization in ([5]) so that decoding can be made tractable. 
We construct the dependency graph of the concerned quantities which is shown in Figure [51 
Using the Bayes ball algorithm liSOl . we conclude that 



7; 



fi^i—mi l^^) ^0 



i-1 



f {^i~mi\(^0 1^0 )i 



£{— I — 2— A:— 1 — — 2— 1\ x/— \—i—k—\ i—m.T — l i—1 



m' ^0 



i-l 



(6) 
(7) 

^-^\V 1 < A; < m/ - 1, (8) 

(9) 



where ^ 



]. Although the conditional pdf f{ui_kW:^ 



and 



|a, ^o~^) depend on the entire past, we assume that these dependencies are rapidly 
decreasing with increase in past time. This is observed in simulation and experimental data 
as well. For making the detection process more tractable, we make the following assumptions 
on this dependence. 



fil^i-mi\O'0 ) ^0 )^fi^i-rni\0'i-m-mi^'^i-mj) 



(10) 



i.e. the dependence is restricted to only the immediate neighbors in the dependency graph. Using 
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the above assumptions and dependency graph resuks, f{zi\a,ZQ ) can be further simplified as, 

X /(l/.|«lm)^Cn^, (Using dll), mMM ) 

f {^i\^i-mi) f {''^i-mi\0'i-m-mi'> ^i-mi)'^k=l f {^i-k\^i-mi i ^i-k-m^ ^i-k) 

X f{Ha^,.Jdvl.m, (Using mm) 

By defining a state S'j = a-.^^^^+i, this can be further expressed as f{zi\Si, zlzl^j). Again 
using Bayes ball algorithm, we conclude that 

fi^i-mj\^i-2mjy^i-m-mi) ~ f {^i-mi\^i-2mi) i (12) 

-r-r2m/— 1 r(— I— i— 2m/+fc— 1 i \ -r-rm/— 1 ^z— i — j— 2m/+fc— 1 i \ 

■'-'■fc=l /l^i-2m/+fc|^i_2mj y ^i-m-rrii ) — ^^k=l J \^i-'imi+k\^i-2mi ^ ^i-m-m.i ) 

772771/ — 1 i'/— I 7 — 277l/ + fc \ /I ox 

X iifc=mj J['^i-2mi+k\ai_2mj+k-m)^ y^^) 

The pdf of zl_^^ = [zf_„^^ . . . zJY given current state Si and previous state Si^i is given by, 

/(^7-7r7j I'S'i; 'S'j-i) = |aj_^_^^) = j f{^i-mi\^i-2mi-:'^i-m-mi)f{^i-2mi\^i-m-m.i)'^^i-2mi 

/f ( —i 1—7 7 N I 7 \T-r2m7 — 1 f(— \—i—2mi+k—l i \ 

J \^i~miWi~2mi^ ^i~m-mi ) J K^i-'^mi {(^i^m-mr ) ^^k=l J \^i-'imi+k\^i-2mi 5 ^i~m.~-mi ) 

^ /('^i 1^^1-27777' '^1-777 - 777/ )'^C2m/ ^ y Z (^777/ I ^^1-27777 ) / ( ^^7-2777/ | ai-777 - 777j ) ^^fc!^! / ( ^^7-2777/ + *; | 

C2:r'"\aU-777jn^:-//(z7,_2777,+.ia::^:::;:t„)/(^ el,!!!!]),©!)) 

where the last step is obtained using results from dependency graph and all the terms in the 
last step except f {ui^2mi\al_.^_^^) and W^^"^ f{ui.2mj+k\i^lZ2T,^''~\ai_^_^^) are Gaussian 
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-mj 


\Si, Si^i) 


/(4 


-1 
-mj 


\Si, Si—i) 



distributed. This implies that the pdf of given (Si, Si^i) is not exactly Gaussian distributed. 
If the number of states in the detector is increased it can be modeled as a Gaussian which means 
that the term like f {i^i-2mi\<^i-m-mi) made Gaussian distributed by increasing the number 

of states, but this increases the complexity. In order to keep the decoding tractable we make the as- 
sumption that f{zl_^^\Si,Si^i) is Gaussian i.e. /(z-.^JSi, Si^i) ~ N {y{Su Si^i),C{Su Si^i)), 
where y{Si, Si^i) is the mean and C{Si, Si^i) is the covariance. With our state definition, we can 
reformulate the detection problem as a maximum likelihood state sequence detection problem UJ, 

S = argmax f{z\S) = argmax n^Q"*^ f{zi\S, zq . . . 

all S all S 

= argmax lifs^ /(zij^i, Si-i, z]Z^J = argmax H^Jq 

all S all S 

= argmin E [l"s( f ' c ) + (^--. ' ^(^- S..,)fC{S., S,.,)-' 

all S \C[Oi^ 

X (zU, - - {z^^ - S..^)fc{S„ S,.^-\zti, - y(^., ^.-i))] 

where S is estimated state sequence, c{Si, Si-i) is the upper nijq x nijq principal minor of 
C{Si,Si-i) and y{Si,Si-i) collects the first mjq elements of y{Si, Si-i). It is assumed that 
the first state is known. With metric given above, Viterbi decoding can be applied to get the 

maximum likelihood state sequence and the corresponding bit sequence. 
E. LMP, GLRT and Bayes Detector 

In ifTTl . the hit detection algorithm is proposed which ignores the modeling of channel memory 
and works well only when the hits are sufficiently apart. In [[T2l . various detectors for hit detection 
like locally most powerful (LMP), generalized likelihood ratio test (GLRT) and Bayes detector 
are presented. These detectors also ignore the system memory and perform detection of single 
hits. Subsequently a majority type rule is used for bit detection. The continuous time innovation 
(e(t)) is sampled at very high sampling rate l/T^ such that T, << T^. As the channel response 
(r(t)) is finite length, the sampled channel response is assumed to have the finite length equal 
to M. The sampled channel response is given by. 

To = [r(t)|,=o m\t=T. . . . r(t)|,=(M-i)Tj^ 
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Determining when the cantilever is "hitting" the media and when it is not, is formulated as a 
binary hypothesis testing problem with the following hypotheses, 

Hq : e = n, Hi : e = TqU + n 

where the sampled innovation vector e = [ei 62 ■ ■ ■ euY , n = [ni n2 ■ ■ ■ um]'^, Tq is the sampled 
channel response, u signifies the value of the impact on media and VImxm denotes the covariance 
matrix of n where Imxm stands for M x M identity matrix. In case of locally most powerful 
(LMP) test given in |fT6l , the likelihood ratio is given by lfT2l . 

f{e\Ho) 

where kmp denotes likelihood ratio for LMP. In our model, there are q number of hits in one 
bit duration. Let lk,imp be the likelihood ratio corresponding to k^'^ hit. The decision rule for the 
detection of one bit in this case is defined as, 

Max(^h,imp{M) , kimpiM) . . . lg,impiM)^ ^1 n (15) 



where ri is LMP threshold. The likelihood ratio in the case of GLRT is IIT2B . 



J[e\-tio) 

where i) is maximum likelihood (ML) estimate of v i.e. z> = argmax,^ f{e\Hi), li^p and Igirt are 
likelihood ratios for LMP and GLRT case respectively. The decision rule for the bit detection 
in this case is defined in a similar manner given in (fTSl) . 

Simulations from a Simulink model of the system can be run for a large number of hits in 
order to gather statistics on the discretized output of nonlinearity block which models the tip- 
media force. We modeled the statistics for z/ by a Gaussian pdf with the appropriate mean and 
variance. With known mean and variance of the likelihood ratio for Bayes test is [[Till . 



hayesiM) = log = 6^1^" V + ^e^V^'e - e^V^ V, 



where fi' = Toa and V = ° ° — and u ~ N{a, A^). The decision rule in this case is also 

i^+^^o To) 



defined in a similar manner given in ( 1151 ). Note that z/ is a measure of the tip-medium interaction 

15 



force and as such it is difficult to experimentally verify the value of this force accurately which 
means the Bayes test cannot be applied for the bit detection on actual experimental data. 

IV. Simulation Results 
We performed simulations with the following parameters. The first resonant frequency of the 

cantilever /o = 63.15 KHz, quality factor Q =206, the value of forcing amplitude equal to 24 nm, 
tip-media separation is 28 nm, the number of hits in high bit duration is equal to 13 i.e. q = 13, 
discretized thermal and measurement noise variance are 0.1 and 0.001 respectively. A Kalman 
observer was designed and the length of the channel impulse response (/) was approximately 
24 which means that m/ is equal to 2. We set the value of the system memory, m = 1. Using 
a higher value of m results in a more complex detector. We used a topographic profile where 
high and low regions denote bits '1' and '0' respectively and the bit sequence is generated 
randomly. The simulation was performed with the above parameters using the Simulink model 
that mimics the experimental station that provides a qualitative as well as a quantitative match 
to the experimental data. Tip-media interaction was varied by changing the height of media 
corresponding to bit '1'. We define the system SNR as the nominal tip-media interaction (nm) 
divided by total noise variance. 

In Figure |6l we compare the results of four different detectors. The LMP, GLRT and Bayes 
detector perform hit detection, as against bit detection. In these detectors, the system memory is 
not taken into account. It is clear that the minimum probability of error for all detectors decreases 
as the tip-media interaction increases which makes SNR higher. The intuition behind this result 
is that hits become harder on media if tip-media interaction is increased which makes detection 
easier. The Viterbi detector gives best performance among all detectors because it incorporates 
the Markovian property of u in the metric used for detection. At an SNR of 10.4 dB the Viterbi 
detector has a BER of 3 x 10"^ as against the LMP detector that has 7 x 10"^. 

V. Experimental Results 

In experiments, a cantilever with resonant frequency /o = 71.78 KHz and quality factor 
Q = 67.55 is oscillated near its resonant frequency. A freshly cleaved mica sheet is placed on 
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top of a high bandwidth piezo. This piezo can position the media (mica sheet) in z-direction with 
respect to cantilever tip. A random sequence of bits is generated through an FPGA board and 
applied to the z-piezo. High level is equivalent to 1 V and represents bit '1' and low level is V 
and represents bit '0' thus creating a pseudo media profile of 6 nm height. The bit width can be 
changed using FPGA controller from 60 — 350 /is. The tip is engaged with the media at a single 
point and its instantaneous amplitude in response to its interaction with z piezo is monitored. 
The controller gain is kept sufficiently low such that the operation is effectively in open loop. 
The gain is sufficient to cancel piezo drift and maintain a certain level of tip-media interaction. 
An observer is implemented in another FPGA board which is based on the cantilever's free air 
model and takes dither and deflection signals as its input and provides innovation signal at the 
output. The innovation signal is used to detect bits by comparing various bit detection algorithms. 
The experiments were performed on Multimode AFM, from Veeco Instruments. Considering a 
bit width of 40 nm and scan time of 60 fis gives a tip velocity equal to 2/3 x 10"^ m/sec. The 
total scan size of the media is 100 micron which means the cantilever will take 0.15 seconds to 
complete one full scan. Read scan speed for this operation is 6.66 Hz. The read scan speed for 
different bit widths can be found in a similar manner. 

The cantilever model is identified using the frequency sweep method wherein excitation 
frequency u of g{t) = Aosmcut of dither piezo is varied from — 100 KHz and p{t) is 
recorded. Magnitude and phase information about G{iuj) is obtained by evaluating the ratios 
between steady state amplitude and phase of output vs input excitation respectively. A second 
order transfer function is obtained that best fits the experimentally identified magnitude and phase 
responses of the cantilever. A, B and C matrices are obtained from the state space realization 
of the identified second order transfer function. F, G and H can be further found using the 
zero order hold discretization at a desired sampling frequency. The discretized state space of the 
cantilever model is used to find the discretized channel impulse response r^.^ (see dS])). 

For 300 jjs bit width, there are around 21 hits in high bit duration and Viterbi decoding 
is applied on the innovation signal obtained from experiment. For experimental model, / is 
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approximately 24 which means rrij is equal to 2. It is hard to estimate the system memory 
(m) from experimental parameters. Fortunately, there is a way around for this. As shown in 
the derivation of the detector, by making appropriate approximations, the final detector only 
requires the mean and the covariance of each branch in the trellis. These can be found by using 
training data and assuming various values of m. We have varied m from to 2 and found the 
corresponding BER using these values of m. The total number of states in the Viterbi detector 
is 2"*+™^. We have observed that for m > 1, the improvement in BER is quite marginal as 
compared to the increased complexity of Viterbi decoding. Accordingly we are using m = 1 
for which the BER from Viterbi decoding is equal to 1 x 10^^ whereas the BER from LMP 
test is 0.26. The BER in the case of Viterbi decoding is significantly smaller when compared to 
the BER for usual thresholding detectors. If the bit width is decreased to 60 fis which means 
there are around 4 hits in the high bit duration, the BER for Viterbi decoding is 7.56 x 10^^ 
whereas the BER for LMP is 0.49 which means that LMP is doing almost no bit detection. As 
the bit width is decreased, there is more ISI between adjacent bits which increases the BER. 
The BER for different bit widths from all the detectors is shown in Figure [81 It can be clearly 
seen that Viterbi decoding gives remarkable results on experimental data as compared to the 
LMP detector. The Viterbi detector exploits the cantilever dynamics by modeling the mean and 
covariance matrix for different state transitions. We have plotted the mean vectors for 2 state 
transitions with 300 fis bit width in Figure Ul There are around 21 hits in one bit duration. 
The Viterbi decoding contains 8 states and 16 possible state transitions. In Figure |7l there is a 
clear distinction in mean vectors for different transitions which makes the Viterbi detector quite 
robust. Thresholding detectors like LMP and GLRT perform very badly on experimental data. 
For a bit sequence like '000011111', the cantilever gets enough time to go into steady state in 
the beginning and hits quite hard on media when bit '1' appears after a long sequence of '0' 
bits. The likelihood ratio for LMP and GLRT rises significantly for such high bits which can 
be easily detected through thresholding. However, a sequence of continuous '1' bits keeps the 
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cantilever in steady state with the cantilever hitting the media mildly which means the likelihood 
ratio remains small for these bits. Thus it is very likely that long sequence of '1' bits will not 
get detected by threshold detectors. 

VI. Conclusions and future work 
We presented the dynamic mode operation of a cantilever probe with a high quality factor and 
demonstrated its applicability to a high-density probe storage system. The system is modeled as 
a communication system by modeling the cantilever interaction with media. The bit detection 
problem is solved by posing it as a ML sequence detection followed by Viterbi decoding. The 
main requirements for the proposed algorithm are (a) the availability of training sequences which 
can provide the statistics for different state transitions, (b) differences between the tip-media 
interaction magnitude between '0' and '1' bit and (c) an accurate characterization of the linear 
model of the cantilever in free air. Simulation and experimental results show that the Viterbi 
detector outperforms LMP, GLRT and Bayes detector and gives remarkably low BER. The work 
reported in this article demonstrates that competitive metrics can be achieved and enables probe 
based high density data storage, where high quality factor probes can be used in the dynamic 
mode operation. Thus, it alleviates the issues of media and tip wear in previous probe based 
data storage systems. 

An efficient error control coding system is a must for any data storage system since the sector 
error rate specifications are on the order of 10^^° for systems in daily use such as hard drives. 
In future work, we are expecting to achieve this BER by using appropriate coding techniques. 
Using run-length-limited (RLL) codes in our system is likely to improve performance and we 
shall examine this issue in future work. We are also working on a BCJR version of the algorithm 
to minimize the BER of the system even further. In experimental data, a small amount of jitter 
is inevitably present which is well handled by our algorithm. At high densities, the jitter will be 
significantly higher and we will need to apply more advanced modeling and detection techniques. 
These are part of ongoing and future work. 
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Fig. 1. (a) Shows the main components of a probe based storage device. The main probe is a cantilever with a tip at one end 
that interacts with the media. The support end can be forced using a dither piezo. The deflection of the tip-end is measured by 
a laser-mirror-photodiode arrangement. The controller employs the deflection measurement to keep the probe engaged with the 
media, (b) Shows a block diagram representation of the cantilever system G being forced by white noise (rj), tip-media force 
h and the dither forcing g. The output of the block G, the deflection p is corrupted by measurement noise v that results in the 
measurement y. Tip media force h = (c) Shows the typical tip-media interaction forces of weak long range attractive 

forces and strong repulsive short range forces. 




Fig. 2. An observer architecture for the system in Figure [TJb) 
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Fig. 3. Continuous time channel model of probe based data storage system 
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Fig. 6. Comparison of various detectors for simulation data. The Bayes curve is not visible in the graph as it coincides with 
the LMP curve. 
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Fig. 7. Mean vector for 2 state transitions for 300 /is bit width from experimental data where '1100' and '1101' represents 
transition from state '110' to state '100' and '101' respectively 
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Fig. 8. BER for Viterbi, LMP and GLRT for different bit widths varying from 60 fis to 300 fis for experimental data. Tfiere 
is a very marginal difference between LMP and GLRT curve which is not visible in the graph but LMP does perform better 
than GLRT. 
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